Corpus: urd_news_2020_30K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 3887 ا-
2 2948 م-
3 2410 ک-
4 2314 ب-
5 1923 س-
Top Character Bigrams
word rank frequency n-gram
1 526 ای-
2 471 ان-
3 430 اس-
4 428 ال-
5 397 کر-
Top Character Trigrams
word rank frequency n-gram
1 125 اسٹ-
2 121 اور-
3 118 کار-
4 89 مار-
5 87 پاک-
Top Character 4-Grams
word rank frequency n-gram
1 72 پاکس-
2 50 محمد-
3 45 اسٹی-
4 45 عبدا-
5 45 بھار-
Top Character 5-Grams
word rank frequency n-gram
1 71 پاکست-
2 46 محمد-
3 45 عبدال-
4 43 بھارت-
5 32 اسلام-
338 msec needed at 2021-07-19 22:07